Fuzzy Imputation Method for Database Systems

نویسندگان

  • José Ignacio Peláez
  • Jesús M. Doña
  • David La Red
چکیده

The missing data and nonresponse problem is a usual difficulty of particular concern in medical and social science databases. Dealing with nonresponse can be a difficult matter and it is important to apply adequate missing data methods to obtain valid inference. Missing data is a very common problem in real data sets, and different methods to solve this problem have been developed. A simple and common strategy is to ignore missing values, thus reducing the size of the useful data set. The experience in databases has demonstrated the dangers of simply removing cases (listwise deletion) from the original data set, and deletion can introduce AbstrAct

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Missing Data Imputation: A Study of Fuzzy K-means Clustering Method

In this paper, we present a missing data imputation method based on one of the most popular techniques in Knowledge Discovery in Databases (KDD), i.e. clustering technique. We combine the clustering method with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply a fuzzy clustering algorithm to deal with incomplete data. Our experiments show that the fuzzy i...

متن کامل

Microsoft Word - ICAME09_opti_leslabay_final

There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...

متن کامل

Microsoft Word - 5_.rtf

There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...

متن کامل

Microsoft Word - Pilar Rey-del-Castillo.rtf

There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...

متن کامل

On a Fuzzy c-means Algorithm for Mixed Incomplete Data Using Partial Distance and Imputation

The focus of fuzzy c-means clustering method is normally used on numerical data. However, most data existing in databases are both categorical and numerical. To date, clustering methods have been developed to analyze only complete data. Although we sometimes encounter data sets that contain one or more missing feature values (incomplete data), traditional clustering methods cannot be used for s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008